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ABSTRACT 

Changes  in  lighting  condition  impact  the  appearance  of  face  images  to  a  large  extent.  Therefore  it  becomes  very 
difficult  to  recognize  a  face  under  different  lighting  conditions.  The  face  images  have  to  be  normalized  to  overcome  the 
illumination  variation  to  get  recognized.  The  work  presented  here  analyzes  the  performance  of  the  illumination 
normalization  methods  of  face  images  like  DCT  (Discrete  Cosine  Transform)  normalization,  Wavelet  Denoising,  Gradient 
Faces,  Local  Contrast  Enhancement,  and  Weber's  law  under  different  lighting  conditions.  The  face  images  are 
preprocessed  and  normalized  by  each  normalizing  method  to  reduce  the  effect  of  illumination.  To  analyze  the  performance 
of  each  method,  the  features  of  these  preprocessed  images  are  extracted  using  Principal  Component  Analysis  (PCA)  and 
the  recognition  is  done  based  on  Euclidean  Distance.  In  this  paper,  the  advantages  and  drawbacks  of  each  method  are 
analyzed.  The  recognition  rate  and  computational  time  of  these  methods  are  compared.  The  Extended  Yale  B  database  is 
used  in  this  work. 

KEYWORDS:  Discrete  Cosine  Transform,  Gradient  Domain,  Local  Contrast  Enhancement,  Principal  Component 
Analysis,  Wavelet  Denoising,  Weber's  Law 

INTRODUCTION 

Automatic  face  recognition  has  a  wide  scope  in  applications  like  authentication,  security,  mug  shot  data  base 
matching  and  surveillance.  The  lighting  condition  changes  from  day  to  night  and  also  between  indoor  and  outdoor 
environments.  These  variations  severely  affect  the  appearance  of  the  face  [l]-[2].  The  potential  change  caused  by  intra 
personal  variation  (variation  within  the  same  class)  is  much  larger  compared  to  that  of  inter-personal  variations  (variation 
between  classes)  [1].  The  3  D  face  structure  when  observed  under  direct  lighting  source  could  produce  a  shadow  that 
would  highlight  or  deter  some  facial  features.  As  a  result  the  face  recognition  tends  to  fail  if  the  test  image  has  a  different 
lighting  condition  than  that  of  the  training  images.  The  methods  to  solve  the  illumination  problem  can  be  generally 
classified  into  three  categories  as  discussed  below.  [12]. 

Face  Image  Enhancement 

This  method  eliminates  the  intensity  variations  caused  by  the  various  lighting  conditions.  It  uses  preprocessing 
techniques  to  normalize  the  images  under  different  illumination  conditions.  For  instance,  global  and  local  contrast 
enhancement  techniques  like  histogram  equalization,  region-based  histogram  equalization,  block-based  histogram 
equalization,  and  logarithmic  transforms  have  been  widely  used  for  illumination  normalization  [3]-[7]. 

Invariant  Feature  Extraction 

This  method  considers  stable  facial  features  that  are  not  affected  by  the  lighting  changes.  For  example,  Edge 
maps,  Local  binary  pattern,  Linear  Discriminant  Analysis  (LDA)  considers  features  which  are  not  sensitive  to  illumination 
variations.  Statistical  methods  for  feature  extraction  largely  rely  on  the  representativeness  of  the  training  samples  [8]. 
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Face  Modeling 

The  illumination  at  different  angles  causes  shadow  in  the  face  image  due  to  the  3-D  structure  of  human  face.  Face 
images  with  fixed  pose  under  varying  illumination  from  different  directions  form  a  convex  cone  called  illumination  cone. 
It  is  represented  by  a  low-dimensional  linear  subspace  whose  basis  vectors  are  estimated  from  training  images.  It  requires 
many  training  images  under  different  illumination  conditions  [9]-[10]. 

In  this  work,  face  image  enhancement  has  been  chosen  to  solve  the  effect  of  illumination  because  it  is  simple  to 
implement  and  it  does  not  require  a  model  to  represent  the  training  images.  The  rest  of  the  paper  is  organized  as  follows:  In 
section  2,  the  detailed  theoretical  background  of  the  illumination  normalization  methods  are  given.  In  section  3,  the 
methodology  of  illumination  normalization  and  the  different  stages  of  the  experiment  are  described.  In  section  4,  the 
experimental  results  are  presented  and  their  performance  is  compared.  Conclusion  is  given  in  section  5. 

ILLUMINATION  NORMALIZATION  METHODS 

The  face  image  I(x,  y)  is  considered  as  the  product  of  the  reflectance  component  R(x,  y)  and  the  illumination 
component  L(x,  y)  [11]. 

/Gr.y)  =  R(x,y)L(x,y)  ^ 

It  is  difficult  to  separate  the  reflectance  and  the  luminance  components  from  face  images.  Therefore,  it  is  assumed 
that  luminance  L  varies  slowly  (low  frequency  component)  and  reflectance  R  varies  abruptly  (high  frequency  component) 
between  the  adjacent  pixels  in  order  to  solve  the  problem. 

DCT  Normalization 

Discrete  Cosine  Transform  converts  the  image  from  spatial  domain  to  frequency  domain.  DCT  is  performed  on 
the  facial  image  to  obtain  all  frequency  components  of  the  image.  Illumination  components  of  the  image  can  be  removed 
by  setting  low  frequency  components  to  zero  [12]. 

Taking  logarithm  transform  on  (1),  we  have 

iog  I(x,y}  -  fojffCxjO  +  lagL(x,y}  (2) 

From  (2),  if  the  incident  illumination  is  L(x,  y)  and  the  desired  uniform  illumination  is  l'(l'  is  identical  for  every 
pixel  of  an  image),  we  have 

logl'Qcy)  =  logR(x,y}  +  log  V 

=  logRQx.y)  +  log  L{x,  y}  —  e{x,y) 

=  taglQc.y)  -  sQx*y)  (3) 
where  Fk,>0  =  !aS  £(*>>0  ~  l°8 

I'(x,  y)  is  pixel  value  under  desired  illumination.  From  (3),  the  normalized  face  image  is  obtained  by  subtracting 
the  compensation  term  e  (x,  y)  from  the  original  image.  The  compensation  term  e  (x,  y)  is  the  difference  between  the 
normalized  illumination  and  the  estimated  original  illumination  in  the  logarithm  domain.  To  remove  the  low  frequency 
components,  the  product  of  DCT  basis  image  and  corresponding  coefficient  are  subtracted  from  the  original  image.  To  set 
'n'  DCT  coefficients  to  zero, 
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I'(x,  y)  is  the  desired  normalized  face  image  in  the  logarithm  domain.  Therefore,  discarding  low-frequency  DCT 
coefficients  in  the  logarithm  domain  is  equivalent  to  compensating  for  illumination  variations. 

The  first  DCT  coefficient  (i.e.,  the  DC  component)  determines  the  overall  illumination  of  a  face  image. 

where  C  (0,  0)  is  the  DC  coefficient  of  the  logarithm  image.  ^  is  the  average  gray  level  intensity  of  the  face  image. 
Since  low  frequency  features  are  discarded,  unstable  facial  feature  hair  is  not  considered  and  it  reduces  computational 
complexity.  The  main  issue  of  this  method  is  the  choice  of  number  of  DCT  coefficients  to  be  considered.  Illumination 
variations  and  facial  features  are  not  perfectly  separated  with  respect  to  the  frequency  components;  therefore  some  facial 
features  will  be  sacrificed. 

Wavelet  Denoising 

Wavelet  transform  decomposes  the  image  into  different  frequency  sub  bands  at  multi-scale.  This  method  extracts 
illumination  invariant  component  R  from  (1)  by  performing  logarithm  operator  on  the  image. 

■  =  K  +  V  (4) 
where  F  ~  log  (I),  R'  ~  log(R),  and  V  ~  log  (L). 

Given  face  images  of  a  person  under  different  lighting  conditions,  the  frequency  corresponding  to  R  (key  facial 
features)  is  different.  To  eliminate  the  illumination  variation,  an  estimate  for  key  facial  features  R  with  small  mean-square 
error  is  given  by 

min  It'2  =min(iJ  -  L'}s 

R'  is  obtained  by  solving  optimization  problem  (5)  such  that  key  facial  features  are  robust  to  different  lighting. 
The  facial  features  R  are  equivalent  to  "noise"  in  denoising  model,  therefore  key  facial  features  R  is  extracted  from  multi- 
scale  space.  Low  frequency  wavelet  coefficients  which  are  highly  sensitive  to  illumination  variation  have  been  discarded. 
Instead  of  directly  estimating  R'  by  (5),  L  is  estimated  by  wavelet  denoising  model,  and  then  K  is  estimate  d  by  (4)  [13]. 

The  image  is  decomposed  into  four  sub  bands  and  sub  sampled  using  2D  discrete  wavelet  transform.  The  wavelet 
coefficients  of  the  signal  L'  denoted  by  X,  is  obtained  from  the  wavelet  coefficients  of  the  logarithm  image  data  P  by 
suppressing  the  signal  R'  with  a  nonlinear  thresholding  function  thresh  [14]  . 
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threshfjx)  = 


X  -  T  if  X  >  T 
X-  7  i:X  <  -7 

0  if  |X  <  r 


The  threshold  Tcan  be  calculated  as  follows 

r  =  - 


■X 


e  =  ■ 


Yii  E  subband  HHL 


?l2i-lL.j=l.  U 


In  general,  the  range  of  X  is  from  0.01  to  0.30(X=0. 1  for  implementation). 

Wavelet  transform  has  the  advantage  of  multi-scale  analysis,  better  edge-preserving  ability  in  low  frequency 
illumination  fields,  fast  computation  and  simpler  parameter  selection.  The  main  drawback  is  that  the  recognition  rate 
depends  on  the  selection  of  X. 

Gradient  Faces 

This  method  converts  the  image  from  pixel  domain  to  gradient  domain.  Gradient  domain  considers  the 
relationship  between  the  neighboring  pixel  and  it  reveals  underlying  inherent  structure  of  the  image.  [15] 

Image  gradients  extract  illumination  insensitive  features  like  edge  from  the  facial  image.  Given  an  image  I(x,  y) 

the  ratio  of  y  gradient  ^  ^         ^/  %  to  x  gradient  ^  ^  3  ^  F>/      js  computed  as  follows. 

From  (1) 

I(x  —  iAx,  y  —  jAy)  =  R(x  -  iAx,  y  -jAy}  l(x  -  iAx.  y  -  jAy  }  ^ 
Subtracting  (1)  from  (6) 

l(x  —  iAx,  y  —  jAy)  -  l(x.y)     =  JSGr  —  iAx.  y  -  jAy)L{x  -  iAx,  y  —  jAy}  -  R(x,  y) I  {x,  y) 
Based  on  the  assumption,  L  is  approximately  smooth 

/Or  -  iAx.  y  -  jAy)  -  l(x,y)  as  R(x  -  iAx..  y  -  jAy)lXx.  y)  -  Rfx,  yiUbc, y) 

■x  (ft(x  -  iAx.y  -  jAy)  -  R(x,y"))L(x,y)  ^ 

Taking  the  X  gradient  of  (7) 

is         1   -     3s  (8) 
Similarly  we  have 

slews    t  ,  anptn 

Dividing  (9)  by  (8), 
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3I(x,y]  ~  3R(x,v) 

The  ratio  of  y-gradient  to  x  gradient  will  be  infinity  when  x-gradient  is  zero.  Thus  the  ratio  cannot  be  directly  used 
as  illumination  insensitive  measure.  Gradient  face  (GF)  of  an  image  I(x,  y)  is  defined  as 

g/Cc,  y) 
dv 

GFfay}  =  arctcm  (  g/(^  y)  ) 
Sac 

Before  computing  the  gradient,  the  image  is  smoothened  by  convolving  with  the  Gaussian  kernel  function  so  that 
it  reduces  the  noise  and  shadows  and  makes  gradients  more  stable.  But,  first  order  derivatives  are  little  sensitive  to 
lumination. 

Local  Contrast  Enhancement  (LCE) 

LCE  increases  the  visual  contrast  of  the  image  in  a  designated  intensity  range.  The  contrast  of  a  gray  scale  image 
is  enhanced  by  representing  the  image  data  as  the  local  contrast  for  each  pixel  in  logarithm  domain.  LCE  is  mainly  useful 
for  improving  the  contrast  of  the  image  [16].  The  local  contrast  5  (x,  y)  for  a  pixel  (x,  y)  with  luminance  value  I(x,  y) 

*  0  otherwise 

where  9  is  a  predefined  threshold  (9=1  for  implementation).  I(x,  y)  denotes  the  average  luminance  value  of  the 
neighborhoods  of  the  pixel  (x,  y),  and  the  neighborhood  window  size  of  a  pixel  adopted  in  the  current  implementation  is 
5x5  pixels  as 

Instead  of  using  the  original  intensity  value  of  the  pixel,  local  contrast  value  is  used.  Local  contrast  is  defined  as 
the  ratio  of  the  original  intensity  of  the  pixel  to  the  average  of  its  surrounding  pixels,  in  logarithm  domain.  The  local 
contrast  value  measured  may  be  positive  or  negative.  Therefore  the  data  has  to  be  normalized.  The  local  contrast  value 
for  a  pixel  (m,  n)  is  normalized  by 

f(x,  y)  =  cp  x  (5(x,  y)  -  8^/  (5max-  8^ 

where  f(x,y)  denotes  the  normalized  local  contrast  value  of  a  pixel  (x,  y),  cp  denotes  the  maximum  gray  level  in 
the  image  data  range  and  5  max  and  8  min  represent  the  maximum  and  minimum  local  contrast  values  of  all  pixels.,  LCE 
can  be  used  when  there  are  significant  variations  among  the  images,  and  even  within  the  single  image.  The  disadvantage  is 
that  the  recognition  rate  depends  on  the  choice  of  window  size. 

Weber  Faces 

Ernst  Weber  in  1983  proposed  that  the  ratio  between  the  smallest  percentage  change  in  the  stimulus  (Almin)  and 
the  background  level  is  a  constant 

— : —  =  tf 
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Where  K  is  the  Weber  fraction  which  remains  constant  despite  of  the  variation  in  the  I  term.  Given  the  face 
image,  for  each  pixel,  two  terms  are  computed:  relative  intensity  difference  of  the  current  pixel  against  its  neighbor, 
intensity  of  the  current  pixel.  The  obtained  ratio  image  is  called  Weber  Face  [17]. 

The  ratio  image  is  computed  by  Weber  local  description  (WLD)  proposed  by  Chen-et  al  [18]  consists  of  two 
components  differential  excitation  and  orientation.  By  applying  WLD  to  the  face  image  Weber  Face  is  obtained. 

IFFfry)  =  arctan(alisAljsA  —  J  (10) 

where  A=  {-1,0,  1} 

The  arc  tangent  prevents  the  output  magnitude  being  large  and  partially  suppresses  the  side  effect  of  noise.  Ic  is 

the  intensity  of  the  current  pixel.  I;  denotes  the  intensities  of  neighbors  (i=0,  1,  JP-l)  (P=8  here). a  is  the  adjusting 

parameter  for  the  intensity  difference  between  the  neighboring  pixels(a=2  used  for  implementation. 

From  (1)  we  have 

Its  —  lAx, y  —  /dy)  =  JT(r  -  iAx, y  - ;Ay)  L{x  —  i&x, y  -  J Ay)  ^\) 
Since  illumination  component  varies  slowly  except  shadow  boundaries 

L(x  -  iAx,  y  -  /dy)  *  £(*.y)  (12) 
By  substituting  (1),(10),(1 1)  into  (10) 


WF(x/y)  =  ar<it2.a  ,-EA^iEA  a<aroo 


( 

Here  the  Laplace  operator  is  used  to  measure  local  intensity  variation  and  edge  detection.  It  is  sensitive  to  noise; 
the  image  is  smoothened  using  Gaussian  filter.  Thus  it  reduces  the  side  effect  of  shadow  boundaries.  Weber  face  preserves 
the  most  important  facial  features.  Weber's  law  breaks  down  for  very  bright  and  very  dark  luminance  level. 

METHODOLOGY 
Illumination  Normalization 

Illumination  normalization  methods  preserve  the  facial  features  and  minimize  the  undesired  artifacts.  In  this  work, 
extended  Yale  B  face  database  is  used.  This  database  consists  of  2,470  front-face  images  of  38  individuals  with  65 
illumination  conditions  [10].  The  experiments  were  performed  using  30  subjects  with  16  sample  images  for  each  subject 

under  the  illumination  condition  of  0-70  . 

These  images  are  normalized  using  the  techniques  namely  DCT  normalization,  Wavelet  Denoising,  Gradient 
Faces,  Local  Contrast  Enhancement  and  Weber  Faces.  Figure  1  shows  the  images  after  illumination  normalization  by 
different  methods. 


i    ~  I 


Figure  la:  Images  under  Different  Illumination  from  Extended  Yale  B  Database 
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Figure  lb:  Results  of  DCT  Normalization 


Figure  lc:  Results  of  Wavelet  Denoising 


Figure  Id:  Gradient  Faces 


Figure  le:  Results  of  Local  Contrast  Enhancement 


Figure  If:  Weber  Faces 

Recognition 

The  different  stages  of  illumination  invariant  face  recognition  system  are  shown  in  Figure  2. 


Illumination 

Feature  Extraction 

Identification 

Normalization  (by  one 

by  Principal 

by  Euclidean 

of  above  mentioned 

Component 

distance 

methods) 

Analysis 

Input 
image 

Figure  2:  Block  Diagram  of  Illumination  Invariant  Face  Recognition  System 

The  performance  of  these  illumination  normalization  methods  are  tested  by  evaluating  the  recognition  rate  in  a 
face  recognition  system. 

First  the  face  images  are  cropped  to  the  size  192  x  168.  The  images  are  then  normalized  using  one  of  above 
techniques  with  MATLAB  7.10  in  a  Pentium  dual  core  PC  system  running  at  2GHz.  Of  the  16  normalized  images  per 
subject,  8  images  are  selected  for  training  and  the  remaining  8  images  for  testing.  DCT  normalization  transforms  the  image 
to  frequency  domain.  And  Wavelet  denoising  converts  the  pixel  values  to  wavelet  coefficients.  Gradient  method  converts 
the  image  from  pixel  domain  to  gradient  domain.  LCE  retains  the  image  in  pixel  domain.  Weber's  law  transforms  the  pixel 
values  to  second  derivative  and  reduces  the  illumination  sensitive  components. 
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The  face  image  is  considered  as  a  very  high  dimensional  space  and  each  pixel  value  corresponds  to  its  component. 
The  dimension  can  be  reduced  by  selecting  the  principal  components  of  the  face  image.  These  principal  components 
inherently  represent  the  face  images  in  lower  dimension.  The  recognition  of  the  test  image  is  based  on  selecting  the  class 
to  which  it  has  the  minimum  Euclidean  distance.  Similarly  the  recognition  rate  is  found  with  images  that  have  been 
normalized  with  the  other  illumination  normalization  techniques.  The  recognition  rate  obtained  for  the  different 
illumination  normalization  methods  is  shown  in  Table  1 . 

Table  1:  Comparison  of  Recognition  Performance  with  5  Different  Methods 


Methods 

No.  of 
Test 
Images 

No.  of 
Recognized 
Images 

Recognition 
Rate  (%) 

DCT 

240 

140 

70.8 

Wavelet 

240 

138 

57.5 

Gradient  face 

240 

222 

92.5 

LCE 

240 

94 

39.2 

Weber  faces 

240 

236 

98.3 

RESULTS  AND  DISCUSSIONS 

Figure  3  shows  the  performance  comparison  of  illumination  normalization  techniques.  From  the  results,  it  is 
evident  that  illumination  normalization  using  Weber's  law  gives  better  recognition  rate  when  compared  to  other 
techniques. 

Figure  la  shows  the  sample  of  training  images  of  class  1  under  different  illumination  conditions  from  extended 
Yale  database.  Figure  lb  shows  the  DCT  normalized  facial  images.  The  recognition  rate  is  lower  because  while  discarding 
low  frequency  coefficients  some  facial  features  are  also  eliminated.  It  is  difficult  to  decide  the  appropriate  number  of  DCT 
coefficients  to  be  considered. 

When  the  lighting  direction  changes  shadows  are  produced  in  the  facial  image.  The  shadows  and  the  facial  feature 
in  the  shadow  region  lie  in  the  same  frequency  band.  Therefore  when  the  illumination  component  is  discarded,  the  facial 
features  are  also  lost.  Elimination  of  these  low  frequency  coefficients  reduces  the  computational  complexity  but  reduces 
the  recognition  rate. 

RecognitionRate 

100  n 


■  Recognition  Rate 


DCT      Wavelet   Gradient      LCE  Weber 

Figure  3:  Recognition  Rate  Vs  Illumination  Normalization  Methods 

Figure  lc  shows  the  images  after  wavelet  denoising.  This  method  is  fast  since  it  analyzes  the  image  in  multi-scale 
and  it  eliminates  the  low  frequency  coefficients.  The  recognition  rate  depends  on  the  parameter  X.  It  is  difficult  to  select  the 
appropriate  value  for  X  since  it  depends  on  the  illumination  condition. 
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Figure  Id  shows  the  gradient  faces.  Taking  gradient  in  the  face  image  provides  edge  strength  and  it  points  in  the 
direction  of  greatest  rate  of  change  of  intensity.  Gradient  faces  gives  a  good  performance,  but  first  derivatives  are  little 
sensitive  to  illumination.  Change  in  lighting  direction  changes  the  magnitude  of  gradient  greatly. 

Figure  le  shows  the  image  after  local  contrast  enhancement.  LCE  avoids  the  changes  caused  in  the  face  image 
due  to  illumination.  Even  though  LCE  produces  features  which  are  less  sensitive  to  illumination,  it  also  produces  undesired 
halos.  Further,  LCE  cannot  distinguish  the  images  of  different  persons  since  the  image  is  considered  in  pixel  domain. 

Weber  faces  shown  in  Figure  If  gives  a  better  recognition  rate  when  compared  with  other  methods  because  Weber 
local  descriptor  is  robust  and  it  can  handle  the  illumination  changes.  These  normalization  methods  reduce  the  effect  of 
illumination,  but  minimize  the  interclass  distance.  Computational  time  for  each  normalization  method  to  recognize  the  test 
image  is  shown  in  Table  2.  Computational  speed  for  Wavelet  Denoising  method  is  faster  when  compared  with  other 
techniques. 

Table  2:  CPU  Time  for  Recognizing  Test  Image 


Time  Taken  for 

Methods 

Recognizing 

DCT 

970 

Wavelet 

690 

Gradient  face 

840 

LCE 

880 

Weber  faces 

940 

CONCLUSIONS 

Various  illumination  normalization  methods  for  face  recognition  have  been  implemented  and  their  performance 
has  been  evaluated.  Weber  faces  appear  to  be  the  best  at  handling  illumination,  with  respect  to  recognition  rate.  Wavelet 
Denoising  technique  is  computationally  faster,  but  the  recognition  rate  is  less.  The  experiment  has  been  performed  by 
considering  the  center  portion  of  the  face.  More  discriminant  features  can  be  obtained  if  entire  face  image  has  been  taken 
into  account.  This  can  help  to  maximize  the  interclass  distance  and  improve  the  recognition  rate. 
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